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We consider a strictly stationary sequence of random vectors whose finite-dimensional distri- 
butions are jointly regularly varying with some positive index. This class of processes includes, 
among others, ARMA processes with regularly varying noise, GARCH processes with normally 
or Student-distributed noise and stochastic volatility models with regularly varying multiplica- 
tive noise. We define an analog of the autocorrelation function, the extremogram, which depends 
only on the extreme values in the sequence. We also propose a natural estimator for the ex- 
tremogram and study its asymptotic properties under Q-mixing. We show asymptotic normality, 
calculate the extremogram for various examples and consider spectral analysis related to the 
extremogram. 

Keywords: GARCH; multivariate regular variation; stationary sequence; stochastic volatility 
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1. Measures of extremal dependence in a strictly 
stationary sequence 

The motivation for this research comes from the problem of choosing between two pop- 
ular and commonly used families of models, the generalized autoregressive conditional 
hctcrosccdastic (GARCH) process and the heavy-tailed stochastic volatility (SV) process, 
for modeling a particular financial time series. Both GARCH and SV models possess the 
stylized features exhibited by log-returns of financial assets. Specifically, these time series 
have heavy-tailed marginal distributions, are dependent but uncorrelated and display 
stochastic volatility. The latter property is manifested via the often slow decay of the 
sample autocorrelation function (ACF) of the absolute values and squares of the time 
series. Since both GARCH and SV models can be chosen to have virtually identical be- 
havior in the tails of the marginal distribution and in the ACF of the squares of the 
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process, it is difficult for a given time series of returns to decide between the two models 
on the basis of routine time series diagnostic tools. 

The problem of finding probabilistically reasonable and statistically estimable measures 
of extremal dependence in a strictly stationary sequence is, to some extent, an open 
one. In classical time series analysis, which mostly deals with second order structure 
of stationary sequences, the ACF is a well-accepted object for describing meaningful 
information about serial dependence. The ACF is sometimes overvalued as a tool for 
measuring dependence, especially if one is only interested in extremes. It docs, of course, 
determine the distribution of a stationary Gaussian sequence, but for non-Gaussian and 
nonlinear time series, the ACF often provides little insight into the dependence structure 
of the process. This is particularly the case when one considers heavy-tailed nonlinear 
time series such as the GARCH model. In this case, the estimation of the ACF via the 
sample ACF is also rather imprecise and even misleading since the asymptotic confidence 
bands are typically larger than the estimated autocorrelations; see, for example, the 
results in Basrak et al. [1] for bilinear processes, Davis and Mikosch [11], Mikosch and 
Starica [26] and Basrak et al. [2] for ARCH and GARCH processes and Resnick [32] for 
teletraffic models. 

1.1. The extremal index 

The asymptotic behavior of the extremes leads to one clear difference between GARCH 
and SV processes. It was shown in Davis and Mikosch [11], Basrak et al. [2], Davis and 
Mikosch [12] (sec also Brcidt and Davis [6] for the light-tailed SV case) that GARCH 
processes exhibit extremal clustering (that is, clustering of extremes), while SV processes 
lack this form of clustering. Associated with most stationary time scries is a parameter 
6*6 (0,1], called the extremal index (see Leadbettcr et al. [24]), which is a measure of 
clustering in the extremes. For example, the extremal index 9 is less than 1 for a GARCH 
process, which is indicative of extremal clustering, while 9 = \ for SV processes, indicating 
no clustering. The parameter 9 can also be interpreted as the reciprocal of the expected 
cluster size in the limiting compound Poisson process of the weakly converging point 
processes of exceedances of (ATt); see, for example, Leadbetter et al. [24] or Embrechts et 
al. [15], Section 8.1. 

In this paper, we take a different tack and study the extremal dependence structure of 
general strictly stationary vector- valued time series (X4). Certainly, the cluster distribu- 
tion of the limiting compound Poisson process contains more useful information about 
the clustering behavior of extremes than the extremal index. Although explicit formulae 
for the extremal index and the cluster distribution exist for some specific time scries mod- 
els (including certain ARM A and GARCH models and some Markov processes), these 
expressions are, in general, very complicated to compute and even difficult to simulate. 
They are also rather difficult objects to estimate and do not always yield satisfactory 
results, even for moderate sample sizes. 
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1.2. Regularly varying time series 

In this paper, wc focus on strictly stationary sequences whose finite-dimensional dis- 
tributions have power law tails in some generalized sense. In particular, we will assume 
that the finite-dimensional distributions of the d-dimensional process (X() have regularly 
varying distributions with a positive tail index a. This means that for any /i > 1, the 
lagged vector Yh = vec(Xi , . . . , X/i) satisfies the relation 

P{\Y,\>x) -^^''^-^ ^'-'^ 

for some non-null Radon measure /i^. on Mq'' = K.'*^ \ {0}, M = M U {±00} , with the 

property that ^h{tC) ~ t^"iih{C), t > 0, for any Borel set C C Kq"*. Here, A denotes 
vague convergence; see Kalleuberg [22], Daley and Vere- Jones [9] and Resnick [30, 31, 33] 
for this notion and Resnick [30, 31, 33] and Hult and Lindskog [21] for the notion of 
multivariate regular variation. We call such a sequence (Xt) regularly varying with index 
a>0. 

Various time series models of interest are regularly varying. These include infinite vari- 
ance stable processes, ARMA processes with i.i.d. regularly varying noise, GARCH pro- 
cesses with i.i.d. noise with infinite support (including normally and Student-distributed 
noise) and stochastic volatility models with i.i.d. regularly varying noise. In Section 2, 
we will be more precise about the regular variation of the aforementioned sequences. 
It follows from general multivariate extreme value theory (e.g., Resnick [31]) that any 
strictly stationary time series whose finite-dimensional distributions are in the maximum 
domain of attraction of a multivariate extreme value distribution can be transformed 
to a regularly varying strictly stationary time series. This can be simply achieved by a 
monotone transformation of the marginal distribution. Hence, the results of this paper 
apply in a more general framework than that of regularly varying sequences. 

For our purposes, it will be convenient to use a sequential definition of a regularly 
varying sequence (X^) which is equivalent to the definition above. Consider a sequence 
a„ 1 00 such that -P(|X| > a„) ^ n~^. Then, (1.1) holds if and only if there exist constants 
c/i > such that 

nP{a-^Yh e •) ^ Wi-) - M-), (1-2) 

where fih is defined in (1.1). Alternatively, for each h>l, one can replace (a„) in (1.2) 
by a sequence (al^^) such that P(|Y/i| > al'''') ~ and then Ch = 1 in (1.2). However, 
for each > 1, an/ ci'n^ ~^ dh as n — > 00 for some positive constants d^. 



1.3. The upper tail dependence coefficient 



As a starting point for the definition of a measure of extremal dependence in a strictly 
stationary sequence, we consider the (upper) tail dependence coefficient. It is defined for 
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a two-dimensional vector {X, Y) with X ~Y a.s the Hmit (provided it exists) 

\iX,Y)= hm P{X>x\Y>x). 

x^oo 

Of course, A € [0, 1], and A = when X and Y are independent or asymptoticahy inde- 
pendent. The larger the A, the larger the extremal dependence in the vector {X,Y). We 
refer, for example, to the discussions in Ledford and Tawn [25] and Beirlant et al. [3] on 
the tail dependence coefficient. 

The tail dependence coefficient can also be applied to the pairs {Xo,Xh) of a one- 
dimensional strictly stationary time series. The collection of values X{Xo,Xh) contains 
useful information about the serial extremal dependence in the sequence (Xt). If one 
considers a real-valued, regularly varying sequence (Xt) with index a > 0, the definition 
of regular variation immediately ensures the existence of the quantities 

P{x-'{Xo, . . . e (1,00) X (0,oo)''-i X (l,oo)) 
A^Ao, A, j _ ^iim ^^^^ _ _ ^ ^^^^ e (1, oo) X (0, oo)'^) 

_ /ift+i((l,oo) X (0,oo)''~^ X (l,oo)) 
//,i+i((l,oo) X (0,00)'') 



1.4. The extremogram 

Now, let (Xi) be a strictly stationary, regularly varying sequence of M'^-valued random 

vectors. Consider two Borel sets A, B in R such that C = A B is bounded 

away from zero and i^h+iidC) =0. According to (1.2), the following limit exists: 

nP(a;^iXo e A,a-'Xh e B) ^ i^h+M x Wf'^'^ x B)^jABih). 
Note that if both A and B are bounded away from zero, then 

^cov(/{„-iXo6A}>^{a,T^x,.ei3}) = ri[P{a-'Xo G A^a-'X^ e B) - P{a-'X e A)P{a-'X e B) 

^ nP{a-^Xn e A, a-^Xh e B) ^ lAsih). 

Also, note that the strictly stationary bivariate time series (^{a"^XteA}'^{a"^Xtes}) 
limiting covariance matrix function given by 

^ 'iaaQi) -fAB{hy 

which has all non-negative components. Since T{h) is the limit of a sequence of covariance 
matrix functions, it must also be a covariance matrix function and hence a non-negative 
definite matrix- valued function; see Brockwell and Davis [8]. In particular, both (jAAih)) 
and (7bb(^)) arc non-negative definite functions and (jABih)) is a cross-covariance 
function and need not be symmetric in A and B. 
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Alternatively, for A and Ax B bounded away from zero and with i^i(A) > 0, one may 
consider 

P(a„ Xft e B|a„ Xo e v4) = > — = Pas (ft)- 



Then, since 

cov(-^{a-iXoeA}^-^{a-iX;,eA}) 
var(-f{a-ixGA}) 



' PAA(h), 



{pAA{h)) is the correlation fmiction of a stationary process. With the exception of A = _B, 
{pAB{h)) and the correlation function (with vi{A)vi{B) > 0) 

/T- 7- \ lAB{h) 

corr(ir„-iv c4\7-'/„-iv cRT-j^" — rie£, 
V {a„ XoeA}' {a„ x,ei3}; ^z/i (yl)i.i (B) 

are, in general, different functions. However, for fixed A, all of these quantities arc pro- 
portional to each other. In what follows, we refer to any one of these limiting quantities, 
considered as a function of /i, as the extremogram of the sequence (Xt). Since A,B are 
arbitrary, there exist infinitely many cxtrcmograms. The sequence of the tail dependence 
coefficients of a regularly varying one-dimensional strictly stationary sequence (Xt) is a 
special case of the extremogram. Indeed, 



HXQ,Xh) — P(1,00),(1,00)(^)- 

As mentioned above, it can be interpreted as a particular ACF. 

Since ^aa can be interpreted as an autocovariance function, one can translate various 
notions from classical time series analysis to the extremogram. For example, one can 
introduce the analog of the spectral distribution corresponding to ^aa- In particular, if 
jAA is summablc, then there exists a spectral density and one may speak of short range 
dependence in the time series context. Alternatively, if 7^1^! is not summable, then one 
can talk of long range dependence. 

The bivariate extremal dependence measure "/ab introduced above can be extended 
in such a way that any finite number of events Ai, . . . ,Ah is involved. Provided the set 
C ^ Ai X ■■■ X Afi is bounded away from zero in Rq a-^d Vh{dC) = 0, one can define the 
limiting dependence measure 

nP(a,7^Xi Ai, . . . ,a-^^h Ah) ^ Vh{Ai x ••• x A,,). 

Such quantities can be of interest, for example, when considering the limits of conditional 
probabilities of the form 



F(a„ ^Xa e A2, . . . , a„ ^Xft e A,>„ ^Xi e Ai), 



(1.3) 
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where Ai is bounded away from zero. Probabilities of this form and their hmits appear 
as the extremal index and the cluster probability distribution of strictly stationary se- 
quences; see also Fasen et al. [16] who consider a generalization of the tail dependence 
coefficient. In this paper, we focus on the two-dimensional case, that is, the extremogram, 
but, in a sense, the extremogram also covers this more general case. Indeed, if we define 
the strictly stationary process Y/j = vec(Xi, . . . ,X;i), then (1.3) can be written in the 
form 

P{a-^Yh eW^xA2X---x Ah\Yi £ l'*^'''^^ x ^i) 

whose limit is an extremogram. 

The paper is organized as follows. In Section 2, we consider some well-known time 
series models, including the GARCH and SV models, and discuss conditions under which 
they constitute a regularly varying sequence. The extremograms are also computed for 
these models. In Section 3, we study estimators of the extremogram. Assuming that the 
sequence (Xf) meets certain dependence conditions such as a-mixing with a suitable 
rate, we show that these estimators are asymptotically unbiased and satisfy a central 
limit theorem. In Section 4, we apply the asymptotic results to GARCH and SV models. 
The Fourier transform of the extremogram can be viewed as the analog of the spectral 
density of a correlogram. The periodogram is similarly defined as the Fourier transform of 
the estimated extremogram. In Section 5, we show that the periodogram is asymptotically 
unbiased for the spectral density. A lag window estimate of the spectral density is also 
formulated and shown to be asymptotically unbiased and consistent. The proof of the 
main theorem in Section 3 is provided in Section 6. 



2. Examples of extremograms 
2.1. Preliminaries on regular variation 

Wc will often make use of a multivariate version of a result of Brciman [7] which can be 
found in Basrak et al. [2]. Assume that the d-dimensional vector X is regularly varying 
with index a and limiting measure that is, and fih in (1-1) are replaced by X 
and fi, respectively. Let A be a random k x d matrix that is independent of X with 
A||"+" < oo for some e > 0. Then, 

where A refers to vague convergence in Mq . 



2.2. The stochastic volatility model 

We consider a stochastic volatility model 

Xt = (JtZt, 
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where the volatihty sequence (at) constitutes a strictly stationary sequence of non- 
negative random variables, independent of the i.i.d. sequence (Zt). We further assume 
that Z is regularly varying with index a > 0, that is, the limits 

r PiZ>x) P{Z<-x) 
p = lim „, 7 and q = lim — -; — ; r 

^ x^o^P{\Z\>x) ^ x^oo P{\Z\>X) 

exist for some p, <? > with p + q = l and P{\Z\> x) = x^"'L{x) for some slowly varying 
function L. If we also assume that i?((T"+'^) < oo for some e > 0, then [Xt) is regularly 
varying with index a. This follows from Brciman's result (2.1); sec Davis and Mikosch 
[12]. Hence, the finite-dimensional distributions of {Xt) are regularly varying with index 
a and the limiting measures Vh in (1.2) are concentrated at the axes, as in the case of 
an i.i.d. sequence. Equivalently, the corresponding spectral measures are concentrated at 
the intersection of the unit sphere with the axes. To be precise (see [12]), 

h 

Vh+i{dxo, dxh) = ^ \a{dx.i)'^e(i{dxj), 

where \a{x,oo\ ~ px^"' and Ac([— oo, —x] = qx~°' , a; > 0, and Sy denotes Dirac measure 

at y. In particular, if ft, > 1 and C = A x M. x_Bis bounded away from zero with 
Vh+i{dC) =0, then 

{\a{A), if G S, A is bounded away from zero, 
\a{B), if e A, i? is bounded away from zero, 
0, if A and B arc both bounded away from zero. 

This means that jAsih) is zero unless cither A or B contains zero. Moreover, pAsih) ~ 1 
if G i? and pAsih) = otherwise. In particular, ^Asih) does not depend on h for h>l. 

If both A and B contain 0, then the set A x Mg ^ x i? is not bounded away from zero. 



2.3. GARCH process 

The regular variation of a GARCH(p, q) process was shown in Basrak et al. [2] under 
general conditions. Here, we focus on a GARCH(1,1) process because the calculations 
can be made explicit; this is not possible for a general GARCH process. A GARCH(1, 1) 
process is given by the equations 

Xt = atZt, tez, 
where (Zt) is an i.i.d. sequence with EZ = and var(Z) = 1, and 



Ct = aiZ2 + /3i. (2.2) 
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The parameters ao,ai,/3i > are chosen such that (Xt) is strictly stationary and the 
unique positive solution to the equation 

EC"^^ = 1 (2.3) 

exists. Then, under regularity conditions such as the existence of a positive density of Z 
on M, the sequences {at) and (Xt) are regularly varying with index a. This follows from 
theory developed by Kesten [23]. Equation (2.3) has a positive solution if Z is standard 
normal or Student distributed. We refer to Mikosch and Starica [26], Theorem 2.6, for 
details in the GARCH(1, 1) case. 

We now calculate the extremogram jab for the sets A ~ (a, oo), B = {b, oo) for positive 
a, b. For more general sets, the calculations become less tractable. We will make repeated 
use of the following auxiliary result whose proof can be found in Mikosch and Starica 
[26] and Basrak et al. [2]. 

Lemma 2.1. Assume that the strictly stationary GARCH(1, 1) process (Xt) satisfies the 
conditions of Theorem 2.6 in Mikosch and Starica [26]. Then, {Xt) is regularly varying 
with index a > given as the solution to (2.3) and the following relations hold for any 
h>2: 

{al ...,al)^ a^oiCo, CiCq, . . . , Ch-i • • • Co) + Ri, (2.4) 
{Xl ...,Xl)^ al{ZlzlC^,. . . , ZlCh-i • • • Ci) + R2, (2.5) 

where, for any e > 0, nP(n^^/"|Ri| > e) 0, i = 1, 2. 

It follows from Lemma 2.1 and Breiman's result (2.1) that 

P{x-^Xl e A,x-^Xl e B) P{x-^alZlla > l^x-^a^Co ■ --Ch-iZl/b > 1) 



P{x-^X^€A) P{X^/a>x) 

E{mm{Z^/a, Cp--- Cu-iZl/b))"'^ 
E{Z^/a)'^/^ 



PAsih). 



It is, in general, not possible to obtain more explicit expressions for pab- In the ARCII(l) 
case, that is, when /3i = 0, we can use (2.3) to obtain 

E{i-nm{Co,Co---C^a/b))-/^ • n n luw^'^ 

PAB{h) = ^CW2 = E{imn{l,Co---Ch-ia/b)) . 

The right-hand side decays to zero at an exponential rate. This can also be seen from the 
following calculations in the GARCII(1,1) case. There exists some constant c> such 
that 

PAB{h) < cE{nim{aiZQ, Co • • • Ch-i{aiZl)))"^^ 
<c£;(min(Co,Co---C„))"/' 
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cP{Co ■ ■ ■ Ch-i > 1) + E{Co ■ ■ ■ Ch-iT/^I^ 



{Co---Cft_i<l}- 



Choose K g (0,a/2). Since the function r EC^ is convex and (2.3) holds, EC^ < 1. 
Then, by Markov's inequahty, P{Co ■ ■ ■ Ch-i > 1) < (EC)'' and 



Hence, pAB{h) < c{EC'^)^, which imphes that pAsih) decays to zero exponentially fast. 
In [13], we give some further examples of extremograms for a GARCH(1, 1) process. 

2.4. Symmetric a-stable processes 

Let (Xt) be a strictly stationary symmetric a-stable (sas) sequence with integral repre- 
sentation 



where (ft) is a sequence of deterministic functions such that ft G L"{E,£,m) for some 
a G (0, 2), £ is a a-field on E and m is a measure on £. The measure m is the control 
measure of the sas random measure M on E. For the definition of a-stable integrals 
of type (2.6), we refer to Samorodnitsky and Taqqu [37]. Conditions for stationarity 
of the sequence {Xt) were given by Rosihski [34]. By the definition of an sas integral 
and stationarity of {Xt), for some constant Cq > 0, the tail of the marginal distribution 
satisfies 



The next result follows from Samorodnitsky [36]; also see Theorem 3.5.6 in Samorodnitsky 
and Taqqu [37] or Theorem 8.8.18 in Embrechts et al. [15]. We have, for A = (a,oo), 
B = (6,oo), a,fe > 0, 



E({Co ■ ■ ■ Q.-i)"/'/{Co...c._,<i}) < E{Co ■ ■ ■ Ch-iT = {EC-f. 




(2.6) 




P[x 



^Xh>b,x-^Xo>a) ^ P{x-^ mui{Xo/a,Xh/b) > 1) 
P{x-^X>a) ^ P{x-^X>a) 



/^[(min(/+,/+(a/6)))" + {minjf,- ja/bWjdm 
L I/oh dm 

PAsih). 



If we choose E — M., m to be Lcbesgue measure on R and 



ft[x)=e 
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then the correspondmg process {Xt)t£z is the discrete version of an sas Ornstein- 
Uhlenbeck process and 

fO e^"^min(l,e-^"^(a/6)")da; , , 

PAB{h) = ^-^ 112^ = min(l,e-^""(a/&)"), h>0. 

f e^"^ dx 

J —oo 

For a = 2 and a/b— 1, this autocorrelation function coincides with the autocorrelation 
function of a Gaussian AR(1) process. 
If we assume that 

and / is constant on the intervals {n — 1,77.] for all n e Z, then {Xt) is a linear process 
with i.i.d. sas noise. In this case, 

oo 

PAB{h)^ J2 ((min([/(n)] + ,[/(/i + n)] + (a/6)))" 

n— — OO 

/oo 



n— — oo 



2.5. ARMA process 

The cxtremogram for an ARMA process generated by heavy-tailed noise can be derived 
directly from the previous example. Suppose that (Xt) satisfies the ARMA(p, q) recur- 
sions 

Xt = (j)iXt-i H h 4>pXt-p + Zt + 8iZt-i H h 9qZt-q, 

where the autoregressive polynomial = 1 — (t>iz — • • • — (f>pzP has no zeros inside or on 
the unit circle and (Zt) is an i.i.d. sequence of symmetric and regularly varying random 
variables. Then, (Xt) has the causal representation 

CXD 

Xt=^ipjZt-j, 

3=0 

where the coefficients ipj arc found from the relation (j){z) J^'jLo V'j^"' (1 + ^i-^ + • • • + 
Oqz'^) (see Brockwcll and Davis [8]). From the previous example with the same sets A 
and B, the extremogram is given by 

E,^o((niin(^;,V'++Ja/&)))" + (min(V'-,V'-+Ja/&)))") 
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In particular, if (Xt) is an AR(1) process with (j>i E (0, 1), then ipj = ip-[ and 

PAB(/i) = min(l,0f (a/6)"). 

Regardless of the values of a and b, the extremogram eventually decays at a geometric 
rate. It is worth noting that for the case a > b, the extremogram may be equal to one 
for several lags before beginning its exponential descent. If we assume that a = b = 1 and 
(j) e (—1,0), then we get 

PAA{2h + 1)=0 and pAA{2h) ^ \^\"^'' . 

This means that an AR(1) process with a negative coefficient has as an alternating 
extremogram paa that is zero for all odd lags and decays geometrically for even lags. 
In this case, the extremogram coincides with the ACF of an AR(2) process with lag-1 
coefficient equal to and lag-2 coefficient equal to Based on the empirical estimate 

of the extremogram, AR-type behavior with a negative parameter cf) can be observed for 
foreign exchange rate high frequency data. See Figure 1 for an illustration. 

3. Consistency and a central limit theory for the 
empirical extremogram 

The aim of this section is to derive relevant asymptotics for the empirical extremogram. In 
Sections 3.1 and 3.2, we establish key large-sample properties for the empirical estimator 
of p{C). Based on these results, the asymptotic normality for the empirical extremogram 
is established in Section 3.3. 

Throughout this section, it is assumed that (Xt) is a strictly stationary, regularly 
varying sequence with index a > 0. The vector X = Xq assumes values in M!^ and has 
limiting measure /i. This means that we replace with X and ph with /i in the definition 
of (1.1). 

The empirical extremogram, defined in Section 3.3, can be viewed as a ratio of estimates 
of ii{A) and p{B) for two suitably chosen sets A and B. We first consider estimates of 

n{C), where C is a generic subset of Rq, bounded away from zero and with p{dC) — 0. 
Then, in particular, 

there exists e>0 such that CclxeK"^: |x| > e}. (3.1) 
A natural estimator of /i(C) is given by 

n 

Pr,i{C) = -^^I{Xt/a^eC}, 
t=l 

where (a„) is chosen such that P(|X| > a„) ^ n~^, rii = ?t!,„ — > oo and irin/n = o(l). 
These conditions on (m„) ensure consistency of Pm(C); sec Theorem 3.1. The estimator 

Pm{C) is closely related to the tail empirical process. We refer to the recent monographs 
de Haan and Ferreira [19], Resnick [33] and the references cited therein. 
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Figure 1. The empirical extremogram with A = B = (l,oo) U (oo,— 1) (upper- left) and 
A = B = (1,00) (upper-right) of a sample of five-minute return data of the foreign exchange 
rate USD-DEM. The middle consists of the extremogram of the residuals from a fitted AR(18) 
model and from the residuals of a GARCH fitted to the residuals A = B = {1, 00). The bottom 
shows the extremogram of a simulated AR-GARCH model. See Section 3.4 for more details. 
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We will work under the following mixing/dependence conditions on the sequence (Xj): 

(M) The sequence (Xi) is a-mixing with rate function (at). Moreover, there exist m„, 
r„ — >■ oo with m„ /n — >■ and rn/run — >■ such that 

oo 

lim m„ y ah ~0 (3.2) 

h—Vji 

and, for all e > 0, 

lim linisupm„ ^ F(|X/i| >ea,„,|Xo| >ea„i)=0. (3.3) 

h—k 

Condition (3.3) is similar in spirit to condition (2.8) used in Davis and Hsing [10] 
for establishing convergence of a sequence of point processes to a limiting cluster point 
process. It is much weaker than the anti-clustering condition D'{ea„) of Leadbetter, which 
is well known in the extreme value literature; see Leadbetter et al. [24] or Embrechts et 
al. [15]. Condition (3.3) is equivalent to 

lim limsup^P(|X;i| > ea„i||Xo| > ea„J = 0. (3.4) 

h—k 

There are various time series models that are a-mixing with geometric rate and for which 
(3.2) and (3.3) are easily verified. These include GARCH, stochastic volatility and ARMA 
models under suitable conditions on the noise; see the discussion in Sections 4.1 and 4.2 
for GARCH and SV models. 



3.1. Asymptotic mean and variance 



In this section, we calculate the asymptotic mean and variance of P,„(C) under condition 
(M). 



Theorem 3.1. Assume that (Xt) is a regularly varying, strictly stationary -valued 
sequence 



sequence with index a > m the sense of (1.1 ). Moreover, let C and C x Mg^ ^ x Cc 



ii-iQ he continuity sets with respect to /i and Hh+i for h>l, respectively, and let C 
he hounded away from zero. If condition (M) holds, then 



EP,n{C) ^ fl{C), 



var(P„,(C)) ^ ^ 



2 5]r,(C) 

h=l 



(3.5) 
(3.6) 



where Th{C) = ^h+i{C x Mq*"'' x C). If ^i{C) = 0, then (3.6) is interpreted as 
var(Pm(C)) = o(m„/n). In particular, we have Pm{C) — fJ-iC). 
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Proof. In what follows, it will be convenient to write 

^'m(C) = mP(X/a,„ e C) = mpo and = P(Xs/a„i € C, Xt/a,„ G C). 
Regular variation of X and strict stationarity of (Xf) imply that 
EP^{C) = P^{C) ^i{C) asn^oo. 

This proves (3.5). 

Turning to (3.6), we first note that 

s 2 / \ 2 n-1 



var(P™(C))= V^j '^var(/{x/a„GC}) + 2|^-^j - ^i) cov(/{Xo/a„ec} J{x^/a„ 

= h+l2- 



By regular variation of X, 



n 

2m„ 



I, = !^[P„(C)(1 -p„)] ^ ^KC). (3.7) 

We have, for k>l fixed, 

(A; r^i n— 1 \ 

+ (i-v^)[m-Po] 

= ^21 + -^22 + -^23- 

The regular variation of (Xt) implies that I21 '''h{C), so it suffices to show that 

lim limsup(|/22| + I/23I) = 0. (3.8) 

Since C is bounded away from zero, (3.1) liolds. Then, since =o(m^), 
I22 = ^ poh+o{l) 

h=k+l 

< TO„ ^ P(|X,i| > ea„, |Xo| > ea„i) +0(1). 

/i=A;+l 

We conclude from (3.3) that limfe^oo fiinsup„_^Q^ /22 =0. Finally, since (Xt) is a-mixing 
and condition (3.2) holds. 



lim I/23I < lim m„ a/i = 0, 

/i=r„ + l 

which completes the proof of the theorem. □ 
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3.2. A central limit theorem for Pm{C) and the empirical 
extremogram 

The following central limit theorem is the main result of this paper. 

Theorem 3.2. Assume that the conditions of Theorem 3.1 hold with kn = n/m„, m„ 
and r„ satisfying knUr,^ — > 0, and rrin = o{n^^^). Then, the central limit theorem 

\ 1/2 



nir, 



^» = ( — ) (^) - G C)] 

1/2 n 



holds, where 

oo 

a2(C)=M(C^) + 2ET^(C). (3.9) 
/i=i 

The condition m„ = 0(71^/'^) can be replaced by the condition 

^a^^O and ^ 0, (3.10) 

which is often much weaker. 



4 



The proof of the theorem is given in Section 6. It is based on a standard big-block/small- 
block argument. Proofs in a similar vein in an extreme value theory context can be found 
in the literature; see, for example, Rootzen et al. [35]. In Section 6, we also propose an 
estimator of the asymptotic variance (t^(C). 

For many examples considered in financial time series and elsewhere, the a-mixing rate 
function aj decays at an exponential rate. In these cases, one can take m„ ^ n^/'^~^ for 
some small (5 > and r„ ~ n^^^ . The choice r„ ^ clogn for some c > also fulfills (3.10). 

A slight adaptation of the proofs given in Theorems 3.1 and 3.2, in combination with 
the central limit theorem in Utev [38], shows that these results hold if condition (M) is 
replaced by the assumption that the process is (/)-mixing with a summable rate function 

A related paper on the prc-asymptotic behavior of the empirical extremogram in the 
case A = (x, 00) and B = {y, 00) is Hill [20]; see, in particular, his Theorem 5.4. In contrast 
to the present paper, he does "not require a model for the bivariatc joint tail nor any 
assumptions concerning the joint tail" (his Remark 15). 
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3.3. Extremogram estimation 

In order to derive the limiting distribution of the extremogram estimator, we first consider 
the large-sample behavior of the ratio estimator of 



R{C,D) 

given by 

Rrn{C,D): 



Pm{C) ' 



where C and D are sets of the type described in Theorem 3.1 with /i(C) > 0. Under 
the conditions of this theorem, Rm{C,D) is a consistent estimator of R{C,D). In what 
follows, we study the central limit theorem for this ratio estimator. 
Observe that 

R,n{C,D)-R{C,D) 

_ P,n{D)KC)-P,n{C)fi{D) 

= i|±^[(/.(C)(P,„p) - EPUD)) KD)iPrn{C) EPUC))) 
+ ifi{C)EP^{D) - fi{D)EP„,{C))]. 

The decomposition (3.11) indicates how we have to proceed. First, we must prove a 
central limit theorem for the first term on the right-hand side. This problem is similar 
to Theorem 3.2 and requires proving a joint central limit theorem for {P„i{C), Pm{D)). 
For the second term in (3.11), we have, by (3.5), 

^iiC)EPra{D) - n{D)EP„,{C) 

(3.12) 

= m„[^l{D)P{a-„^X £C)- fi{C)P{a;^'X e D)] = o(l). 

However, for a central limit theorem for {n/mny^^{Rm{C,D) — R{C,D)), one needs to 
know the rate of convergence of (3.12) to zero. This is, in general, a difficult problem 
which can sometimes be solved when one deals with a specific time series model; see, 
for example. Section 4.2 in the case of a stochastic volatility model. Alternatively, one 
could assume conditions on the rate of convergence in the relations Pm{D) — > ^{D) and 
Prn{C) — > ^ji.{C). Such conditions are common in extreme value theory. 

We formulate the central limit theorem for the finite-dimensional distributions of the 
ratio estimator in the following corollary. 
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Corollary 3.3. Assume the conditions of Theorems 3.1 and 3.2 hold for the sets^ 
Di, . . . , Dh,C and the sequence (Xf). Moreover, let fJ.{C) >0. Then, 



\ 1/2 

— A) - Rm{C, A)]»=i,...,/. ^ N{0, (/x(C))"4f'SF), 

where 

and where S anrf F are defined in (3.15) and (3.18), respectively. If, in addition, 

lim V^[^(A)-P(a^iX e C) - ^(C)P(a^iX e A)] =0, ^ = 1, . . . , /z, (3.13) 

\ 1/2 



[R^{C, D^j - A)].=i,...,h 4 iV(0, (/z(C))-*F'SF). 
Proof. In order to ease notation, we set Dh^i = C. We show the central limit theorem 

^ / /c,(X,/a„0-^'fc'Xe A) 

^) E ; |4iV(0,S), (3.14) 



where 



A;,+i (Xt/a„) - P(a„^X e Ai+i) . 
/ cr2(£)i) rr,,,_D3 ?'-Di,D3 ■■■ fD.Mu^ 



S = 



(3.15) 



and, for i ^ j , 

oo 

- ^i{D^ n A) + 2^ Mft+i(A X (Mo)''"' X A)- (3-16) 



By the Cramer- Wold device, it suffices to show the central limit theorem for any linear 
combination 

z'S„ 4Ar(0,z'Sz), ze]R''+^ 



iThis means that the set C in Theorems 3.1 and 3.2 has to be replaced by the Di's. 
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The same ideas as in the proof of Theorem 3.2 show that it suffices to prove the central 
hmit theorem for k„ i.i.d. copies of 



nin h+1 



By the central limit theorem for triangular arrays, one needs to verify that for every 
e>0, 



(3.17) 



fc„i?(r2(z)/{|T„(z)|>.})^0. 

This follows from Markov's inequality and (6.4) when 7n„ = o{n^/^): 

K.ET^iz)I{\T„i.)\>e} < cm2p(|T„(z)| > e) < cm^ fc-^ = o(l). 

If the conditions (3.10) are met, then the argument at the end of the proof given in 
Section 6 can be used to establish (3.17). This proves the central limit theorem (3.14). 
We observe that 

Rm{Dh+l,Di) — Rm{Dh+l,Di) 
l+Op(l) 



{{PmiD,) - P^{D,))fl{Dh+l) - {PraiDh+l) - (A.+i))m( A))- 



Hence, 



n 

rrir, 



1/2 



[Rm{Dh+l, Dl) — RmiDh+1, Dl)]i=l,...,h = , ^^nnq ^S^ 



iv(o,(/i(7?^+i))-4r'SF), 



where 



/Ai(A>+i) 




V 
This proves the result. 




/i(A.+i) 



-A^(A) 



(3.18) 



••• [i[Ph+i) -ii[Dh)l 



□ 



Recall that, for subsets A and B of Mg that are bounded away from zero and y^idA) 
liidE) = 0, /^(A) > 0, the extremogram at lag h is defined by 



PAsih) = lim 



P{a-'XoeA,a-^XheB) 
Pian'Xo e A) 
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A natural estimator of pab is the empirical extremogram defined by 

En— i Y 

PAB{i) = ^=^75 f , 1=0,1,.... 

This estimate can be recast as a ratio estimator by introducing the vector process 

Yt = vec(Xt,...,Xt+ft) 

consisting of stacking h+1 consecutive values of the time series (Xt). Now, the sets 

C and Dq,. . . ,Dh specified in Corollary 3.3 are defined via the relations C = A xM.q , 

Do^AnBx Mq'' and = A x 1^^*"^^ x B x W^f''^'^ for i > 1. With this conversion, 
Corollary 3.3 can be applied to the (Yt) sequence directly to show that pAB{i), centered 
by the pre- asymptotic value of the extremogram defined by 

P(a,~^Xo£ A,a,-iX, £g) 

PAB,rn{i) = _i , (3.19) 

F[am X G ^) 

is asymptotically normal. On the other hand, if the bias condition (3.13) is met, then one 
can center the empirical extremogram by its true value and still retain the asymptotic 
normality. For completeness, we record these results as the following corollary. 

Corollary 3.4. Assume that the conditions of Corollary 3.3 are satisfied for the sequence 
(Yt). Then, 

\ 1/2 

— [pAB{i)~PAB,mm-o,i....,h^N{0,{fi{A))-''F"EF). (3.20) 
mn J 

Moreover, if (3.13) is satisfied, then (3.20) holds with pAB,m{i) replaced by pAB(i)- 



3.4. An empirical example 



For illustrative purposes, we compute the extremogram for the high-frequency financial 
time series consisting of 35135 five minute returns of the foreign exchange rate USD- 
DEM. The data, which was provided in processed form by Olsen and Associates (Ziirich) 
at their Second Conference on High Frequency Financial Data in 1995, was one of the first 
widely disseminated high frequency data sets. Aside from the choice of A and B, the most 
problematic issue in computing the extremogram is the selection of a suitable threshold 
am- The choice of threshold is always a thorny issue in extreme value theory, from 
estimating the tail index of regular variation to fitting a generalized Pareto distribution. 
We tried several different choices of threshold based on various empirical quantilcs of the 
absolute values of the data. Of course, there is the typical bias- variance trade-off in this 
selection with a large m corresponding to a smaller bias, but larger variance, and vice 
versa for a moderate value of m. In the upper- left panel of Figure 1, we plot the sample 
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extremogram for lags 1-100 using A = B = (— oo,— 1) U (l,c») with several choices of 
threshold a™ starting with the 0.97 empirical quantilc of the absolute values (solid line) 
and continuing with the 0.98, 0.99, 0.995 and 0.999 quantiles (dotted and dashed lines). 
With the exception of the 0.995 and 0.999 quantiles, the extremogram has roughly the 
same value for all lags. The extremogram based on the 0.995 quantile is also similar, at 
least for the first 38 lags, and drops off to zero due to a lack of pairs of lagged observations 
that exceed this large quantile. In view of this robustness of the plots for this range of 
thresholds, we will choose am to be the 0.98 quantile in all of the remaining extremogram 
plots. 

The extremogram of the returns with A = B = {l,oo) is displayed in the upper- right 
panel of Figure 1 . Note that the extremogram alternates between large values at even lags 
and smaller values at odd lags. Like many high frequency data sets, the autocorrelation 
function for this time series alternates between positive and negative values. To further 
investigate this alternating behavior of the extremogram, which may be due. in part, to 
an artifact of the processing of the data by Olsen, we fitted an AR model to the data. 
The best fitting AR model, based on minimizing the AICC, is of order 18. We then 
refined this model by selecting the best subset model which ended up having significant 
non-zero coefficients at lags 1, 2, 3, 5, 6, 7, 11, 13, 14, 16 and 18. The lag-1 coefficient, 
which was much larger than the other lags, was -0.6465. So, this alternating character of 
the extremogram is consistent with the extremogram of an AR(1) process with negative 
coefficient described in Section 2.5. In the middle-left panel, we plot the extremogram of 
the residuals from the subset AR(18) model fit with A = B = (l,oo). As a baseline, we 
have also plotted the horizontal line that one would expect for the extremogram if the 
data were in fact independent and the threshold was the 0.98 quantile of the absolute 
values. Note that the values are now significantly smaller than those for the returns. Some 
extremal dependence still remains in the residuals, at least for small lags. This behavior is 
an indication of the presence of nonlinearity in the data. In fact, the ACFs of the absolute 
values and squares of the residuals are highly significant. We were moderately successful 
in eliminating part of this nonlinearity by fitting a GARCH(1, 1) model with t-noise to 
the residuals. The extremogram of the GARCH residuals, which still exhibits some ACF 
in the absolute values but none in the squares, is displayed in the middle-right panel of 
Figure 1. Based on this extremogram, there is little remaining extremal dependence in the 
GARCH residuals. As a last check on this modeling exercise, we simulated a realization of 
the time series based on the fitted model. In other words, we generated a time series from 
the GARCH model and then passed it through the fitted AR filter. The extremogram of 
this simulated scries is displayed at the bottom of Figure 1. It displays similar features to 
the original extremogram (top-right panel of Figure 1), but the dependence is not quite 
as strong or persistent as for the original data. 

In Figure 2, we chose A = (1, oo) and B = (— oo, —1) for computing the extremograms 
for the return data (left) and the residuals from the AR(18) fit (right). For the return 
data, the extremogram at the first lag has a large positive value and alternates at the 
odd and even lags. On the other hand, for the residuals, there is a real difference in shape 
of the extremogram from that displayed in Figure 1. 
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USD-DEM 5 minute returns USD-DEM resittuala of sn AR(1S) motlel 



Figure 2. The empirical extremogram (left) of a sample of five- minute return data of the foreign 
exchange rate USD-DEM and the residuals (right) from a fitted AR(18) model with A = (1, cxj), 
B — (— oo, —1); see Section 3.4 for more information. 

4. Application to GARCH and SV models 
4.1. The GARCH process 

Assume the conditions of Section 2.3 hold for a regularly varying, strictly stationary 
GARCH(1,1) process with index a > 0. The GARCH process is /3-mixing, hence a- 
mixing, with geometric rate under general conditions on the noise; see Boussama [5] and 
Mokkadem [27]. In the GARCH(1, 1) case, these conditions hold provided the density of 
the noise variables Zt is positive in some neighborhood of the origin. 

In what follows, wc assume that (Xt) is a-mixing with a geometric rate function 
at < ca* for some a G (0, 1), c > 0. First, recall that the k chosen in Section 2.3 satisfies 
K e (0,Q!/2) and EC^ < 1, where C ~ aiZf + /3i. Second, the normalizing constants a„ 
are chosen such that f (|Ar| > a„) ~ n^^. In particular, a„ ~ cn^^". Now, select m„ = 
{d £ (0, 1)) and r„ = for 7 < min((l — 6)/3,S{2K/a)). With these choices of m„,r„, we 
verify conditions (M) and (3.10). The mixing condition (3.2) is straightforward to check 
since (Xt) is a-mixing with geometric rate. To check (3.3), it suffices to show (3.4). Using 
the recursion in (2.2), we have 

t 

a^=Cf-Cial+aoJ2Cf-a+i, t>0. (4.1) 

i=l 

If we assume, for the sake of simplicity, that e = 1, then (4.1) and Markov's inequality 
imply that 

P{\Xh\ >a™||A:o| >a™) 

< PiZlCh ■ ■ ■ C\a^ > alJ2\\Xo\ > a„0 + P (zlao ^Ch--- C,+i > af„/2 

V i=i 
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-P(l^o|> am) ^ 
= h{h) + h{h). 

With the prescribed choices of r„ and m„, we have 



h=k+l 

By again applying Markov's inequahty and Karamata's theorem (see Bingham et al. [4]), 

P(C„...M2>ay2) 



^ ''^^^^ ^ P{\X\>a^) 

h=k+l h=k+l ^' ' ™^ 

Appropriately combining the above facts shows that (3.3) is satisfied. 

Finally, it is straightforward to check (3.10) from the choices of 5 and 7. Hence, the 
conclusions of Theorems 3.1 and 3.2 apply to the GARCH(1,1) process. To date, we 
have not been able to verify the bias condition (3.13) of Corollary 3.4. One needs a more 
precise estimate than is currently known for the tail distribution of cr^; see Goldie [18] 
for results in this direction. 



4.2. The stochastic volatility process 

The stochastic volatility process {Xt) has regularly varying finite-dimensional distribu- 
tions with index a if the multiplicative noise (Zf) is regularly varying with index a and 
the volatility at has a moment of order a -f e, e > 0. In fact, for the following argument, 
we will assume that at has a finite 4Q!th moment. In addition, we assume that the mixing 
condition (3.2) is satisfied for {at). If the sequence {at) is a-mixing with rate function 
(of/i), then {Xt) has rate function {Aah), and hence {Xt) also satisfies (3.2). 

We next dispense with condition (3.3) with e = 1; the general case e > is completely 
analogous. Using the independence of {Zt) and {at), an application of Markov's inequality 
for p < a yields 

m„ £p(a-i|Xo| > l,a-i|X;,| > 1) 

h=k 

< TO„ ^ P(max((To,o-i,crh,o-/i+i)min(|2'o|, |Zi|, \Zh\, \Zh+i\) > a„) 

h=k 
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= irin g[P(max(go, ai , ah,ah+i)Z > am\{<Jt))f 

h=k 

<cmnr^iEaP)^a-^P. 

Since r„ — o(to„), the right-hand side vanishes if p is chosen close to a. 

We now turn to the problem of verifying the bias condition (3.13) so that we can 
apply the limit theory of Corollaries 3.3 and 3.4 to the empirical extremogram. To this 
end, we assume, for convenience, that log (if is a stationary Gaussian sequence with 
mean zero and unit variance. Choose m„ = n'^ for some 7 £ (1/3,1) and suppose the 
mixing function decays sufficiently fast so that (3.10) holds. For example, if at decays 
geometrically, one could take r„ = (1 — 7)/4. If we choose the sets A = (1, 00) x (0, 00)'*^^ 
and S = (1, 00) X (0, oo)'*~^ x (1, 00), then fihiA) > and ^ih{B) = 0. Set s„ ~ for some 
0<S < (37 - l)/(4a) and note that 

P(a;;^Xo > l,a^,^Xh > 1) < P(max(cro,o'/i)min(Zo,Zi) > a^) 

< P(max((7o,o-/0 > s„) + P(min(Zo,Zi) > a,„/s„)- 

Since (m„n)^/^ — j^o. 5(1+7)^ g^j^ application of Markov's inequality yields that for any 
fc>0, 

(m„n)i/2p(max(ao,a^) > s„) < n''-^^^+^hE{a'')s-\ 

The right-hand side converges to zero for k sufficiently large. On the other hand, for any 
e>0, 

(m„n)V2[P(Z > a,„/s„)]2 < n°•5(l+^)(n^/"/"')"'"■'^ 

The right-hand side converges to zero for small e. This shows that (3.13) is satisfied 
for a stochastic volatility model and sets A, B as specified. Applying Corollary 3.4, we 
conclude that 

(n/m„)i/2[p^B(l) - Pab{1)] ^ N{Q, a^{B)fi-^A)), 

where, of course, pab{^) — in this case and cr^{B) = 0. Therefore, we get a degenerate 
limit for this choice of A and B. 

As a second example, let A = (l,oo) x (0,cx)) and B = {{xi,X2)- L < xi — X2 < U, 
xi >0,X2> 0}, where L <1 <U. With Xt = {Xt,Xt+i), a straightforward calculation 
shows that fi{A) = 1 and /i(-B) = L^" — U^". Since the measure /i only concentrates on 
the two coordinate axes. An B intersects the a;2-axis in the empty set and intersects the 
xi-axis in the interval [1,C/]- Hence, 7as(0) = 1 — U~°'. Since the limiting measure of 
(Xq, Xi, Xfn Xh+i) concentrates on the four coordinate axes, 

nP{a-^Xo e A, a'^Xh E B) ^ lAsih) = 
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for h> 1. On the other hand, {{xi,X2,X3): {xi,X2) € B and {x2,X3) £ B} has empty 
intersection with the three coordinate axes and hence 7s_b(1) = 0. More generally, we 
have 

Using Corollary 3.3, we have 

(n/m„)i/2(p^^(0) - pAB.mm ^ N{Q, 1 - [/-"). 



5. Some spectral analysis 

The extremogram "fee = {Th{C)) with Th{C) = T-h and to(C) = fJ.{C) as defined in The- 
orem 3.1 is an asymptotic covariance function. If it is summable, then the function 

oo 

/(A) = Y,rh{Cy''^ = ^i{C) + 2^cos(A/i)r^(C), A S [0,7t], 



h=l 



defines the corresponding spectral density which, in turn, determines jcc- The sample 
version of the spectral density / is given by the periodogram 



t=l 



,it\ 



%{0)+2j2cos{Xh)%{h), Ae [0,7t], 



h=l 



where 



It = /{Xt /a,„ ec} and h = h - Elt = /t - _po , 



and 



n—h 



n—h 



%{h) = — S^Itlt+h, ^n{h) = — y^Itlt+h, h>0, 
t=i t=i 

are analogs of the sample autocovariance function of a stationary sequence. Since it is 
common to evaluate the quantities /„c(A) at the Fourier frequencies A = Afe = 2nk/n G 
(0,7t) and J2t=i ^' '^^'^ ^^"^ define Inci^k) with the it's replaced by the /^'s which 

do not contain the unknown probability po- However, for the calculations which involve 
mixing conditions, it is crucial to use the given definition of IncW with the centered 
quantities It- 

In what follows, we will mostly deal with the lag-window estimator (see Brockwell and 
Davis [8]) or the truncated periodogram fncW defined by 



^c(A)=7n(0) + 2£cos(A/i)7„(/i), Ae [0,7t], 



h=l 
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where r„ — >■ oo and r„/m„ — > as n — > oo has the same interpretation as in the previ- 
ous sections. Truncated estimators of the form fnc are commonly used in the spectral 
analysis of stationary time series; see, for example, BrockwcU and Davis [8] and Priest- 
ley [29]. A major reason for this is that, unlike the periodogram of a stationary time 
scries, the truncated periodogram fnc is a consistent estimator of the spectral density. 
In our setting, we show below that fnc remains a consistent estimator of /(A). Based on 
background calculations, it appears that IncW ^ot consistent. 



Theorem 5.1. Assume the mixing condition (M) for the regularly varying, strictly sta- 

tionary sequence (Xj) with index a > and that the products C C Kg are continuity 
sets with respect to the limiting measures jjLk, fc = l,2,..., occurring in the definition of 
regular variation. Then, 

lim EI,,c{\) = lim /„c(A) = /(A), A e (0,7t). (5.1) 

n— J-oo n— ^oo 

In addition, if rrinr'^ = 0(^'^)7 then we also have 

lim i?[(/„c(A)-/(A))V0, Ae(0,7t). (5.2) 

n— >oo 

This means that the estimator /„c(A) of the spectral density /(A) is asymptotically un- 
biased and mean-square consistent. 

The rate of convergence in (5.2) cannot be derived unless one assumes conditions 
similar to (3.13). 

Proof. In what follows, it will be convenient to use the notation 

= -P(X/a™ e C), pst^ P{^s/am^C,^t/ayn^C), 

Ist=lslt, Pst=EIst, 

We will exploit the following auxiliary result. The proof is completely analogous to the 
proof of Theorem 3.1 and is therefore omitted. 

Lemma 5.2. Under the conditions of Theorem 5.1, 
E%{h)^^n{h)^Tu{G), 



var(7„(/i)) var(7„(/i)) - 



t=i 



h>Q, 



where, for h>0 and t>0, TQht,t+h{C) = lim„^oo mpoht,t+h- 
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We start by considering the expectation of the periodogram. We have, for fixed A: > 1, 



EIncW = iTT-nPo + 2m„ ^(1 - n ^h)cos{Xh)po 

h=l 
n-1 

= m„po + 2m„ ^(1 -n^^h) cos{Xh)[poh - Pol 

h=l 

k 

= /i(C) + 2^cos(A/i)T,,(C) 



h=l 



n-l 



\h=k+l /i=r„+l/ 



Using condition (M), 



lim UmsupTO„ 



Km m„ 



h=k+l 
n-l 



Y ^h)cQs{h\)[pf)h-vl] 

h=r„ + l 



< Um hmsupm„ Poh = 0, 

k ^ OO 7-1. — i-rv^ 

h=k+i 

00 

< UmsupTO„ a/i = 0. 



/i=r„ + l 



The relation hin„_>.oo EfncW — fW is derived in the same way. 
We conclude from Lemma 5.2 that for any fc > 1, 

var ^(7„ (0) - ) + 2 ^ (^n (h) ~ ThiC))^ ^ 0. 

Therefore, it suffices for (5.2) to show that 



lim lim sup var > cos(Aft,)7„(/i) =0. 
It suffices to bound the expression 

^ = E E I cov(7„(M,7n(0)i < 5" E E E E I cov(/,,,+,, 



h—k l—k 



h=k l=k t=l s=l 



We have 



^ ^ 2^ E E E I COvilohJr.r+l) 



h^k l^k r^O 
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h=fc /=A: r=0 



n 



/i=fc l=k r=2r„ 



2 ^Ti 



<4^EE[^'o'^+Po' + 2l'o]+2 



E 



< 



TTi>fir>fi 



h^k 



3 2 
TOnPo) PO — +2 ^ 

n n 



r—mn—rji 



Condition (M) and the growth restrictions m„r^ = 0(n) yield the desired result 

lim linisup/ = 0. 



6. Proof of Theorem 3.2 
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□ 



Proof. We use the same notation as in Section 5 and write 

Ynt = ('7i„/n)^/2(/{Xt/a„GC} - Po) = {mn/nf^Jt, t^l,...,7i. 

In order to prove the result, we will use the technique of small/large blocks which is well 
known in the asymptotic theory for sums of dependent random variables. For simplicity, 
we will assume that n/m„ = fc„ is an integer. The non- integer case does not present any 
additional difficulties, but requires additional bookkeeping. We introduce the index sets 



Ini = {{i - ^)mn + 1, 



. , im 



ri}j 



i 1 , . . . , . 



By Ini, we denote the index set which consists of all elements of Ini but the first r„ 
elements and we also write Jni = Ini \ Ini- Since rn/rrin — >■ and m„ — >■ oo, the sets /„, 
are non-empty for large n. For any index set B of the integers, we write 



SniB) 



- E ■ 

jeB 



We first show that 



var[ y^^SnjJni) 

We have 

var|^^S'„(J„()^ < fcn var(S'„(J„i)) + 2fc„ ^ | cov(S'„( J„i), S'„(J„ji+i))| 
= Pi+P2. 



0. 



h=l 
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We observe that, by (3.3) and since rn/run 0, 



Pi 



iPo + 2 ^ (r„ - h)pQ,i 

h=l 

<2(r„/m„)m„ ^ poh + o(l) = o(l). 

h=k+l 

Moreover, for positive constants c, 

^^-E E E --(^-^^) ^E E E 

oo c» 

< cr„ ^ a?i<cm„ ^ a/i = o(l). 
This proves (6.1). 

Condition (6.1) iniphcs that Sn and X]f=i "^"(-^"i) have the same hmit distribution, 
provided such a hmit exists. Let Sn{Ini), i = 1, . . . , fc„, be i.i.d. copies of Sn{Ini)- In what 
follows, we use a classical idea due to Bernstein dating back to the 1920s. Iterated use of 
the definition of a-mixing and standard results for strong mixing sequences (see Doukhan 
[14]) yield, for any t e M, 



£;]-[eits„(7„,)_^]-[e^ts„(/„ 



1=1 



i-i 



k„ 



s=l+l 



^E 



1=1 s=l 

l-l 



(6.2) 



s=l s=l+l 



< knAUr 



By assumption, the right-hand side converges to zero as n — > oo. Hence, Yl\li ^n{Ini) 
and X]f=i'^n(^"0 have the same limits in distribution (provided these limits exist). Let 
Sn{Ini), i = 1, . . . , fcn, bc au i.i.d sequence with the same distribution as Sn{Ini)- A similar 
relation as (6.1) ensures that it suffices to prove that 



i=l 



(6.3) 
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We first verify tliat 

var|^^^„(/„,)j =fc„var(^„(/„i))^a2(C). (6.4) 

We have 

A:„ var(^„(/„i)) = var^y^/f^ 

Tlln — l 

= m„ var(/o) + 2 ^ (m„ - h) cov(/o, /ft). 
/i=i 



By regular variation, 
Fix A:> 1. Then, 

/ k 



TO„var(/{Xt/a„ec}) ^m(C)- (6-5) 



51+ I] + J2 ](mn-h)cov{Io,h)^Ri + R2 + R3- (6.6) 

\h=l h=k+l h=r„ + l/ 

By the same argument as for (6.5), i?i — > ^-nd similar arguments as those 

for I22 and /23 in the proof of Theorem 3.1 show that (6.4) holds. 

We apply the central limit theorem for the triangular array of i.i.d. mean-zero random 
variables Sn{Ini), i = 1, . . . , fc„. By Gnedenko and Kolmogorov [17], Theorem 3, page 101, 
or Theorem 4.1 in Petrov [28], and since (6.4) holds, one needs to verify the following 
condition for any e > 0: 

A:„i?[(5„(/„i))2/^l^^(^^^^l^^j]=i?(^£7,j /^|-^^^^^j|^^^->0. (6.7) 

A trivial estimate of the right-hand side is given by 

cm2p(|5„(/„i)| > e) < ce-^m^ var(5„(/„i)) = 0{ml/n) = o(l). (6.8) 

Next, we show that (6.7) holds under the conditions (3.10). We have, by the Cauchy- 
Schwarz and Chebyshev inequalities and (6.4), with J„i ~ X]"="i ^t' ^^r constants c > 0, 

{EJll[\s„ii„,)\y,}f < EJiP{\Sn{Ini)\ > e) < cE J^^var{S,,{Ini)) = EJiO{mJn)(6.9) 
We focus on the fourth moment of the partial sum J,„, which can be written as 

EJm ~ Elstuv 
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5=1 S^t S<t 

= Ai + C1A2 + C2A3 + C3A4 + 0(1). 
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+ 0(1) 



s<t<u<v 



Now, 



EAi = ninElf < mnE\h\ < 2m„po = 0(1). 
Since /( has mean 0, we have 

\A2\-- 



E + E 

\s-t\<rn |s-t|>r,/ 

<2m„r„po+ E \E{U?) - EIsElf)\ 

\s-t\>rr, 

< 2m„r„po + 4m„ ^ aj = 0(r„). 



The third term can be deah with as follows: 

^3 = ( E + E )^(^?^' 

^|s-t|<r„ |s-i|>r„^ 



EI^EI?) 



\s-t\>rn 

< 2m„r„po + 4to„ ^ aj + 'im^^pl = 0(r„). 



We decompose the index set of the fourth term into four disjoint sets: 
Ki = {(s, t, u, u): l<s<t<u<v< m„, u — w > r„}; 
K2 = {(s, t, w, «)• ^<s<t<u<v< m„, V ~ u< Vn, u — t> r„}; 

— {{s,t,u,v): I < s <t < u < V < m„, u — u<?'n,w — t<r„,i — s>r„}; 
K4 = {(s, i, M, u): 1 < s < i < u < w < m„, w — u < rn,u — t < r„, t — s < r„}. 



We then obtain 



1^4 



E+E+E+EV^? 



< E «j + cm2r„ ^ a, + 0{rl) + m^rl ^ + 0(r3) 
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where the bounds for the sums in the penuhimate hne follow in the spirit of the arguments 
used to derive the orders for A2 and A3. 

Combining the bounds above with the bound (6.9), the conditions in (3.10) ensure 
that (6.7) is satisfied. This completes the proof. □ 

Relation (6.4) suggests the following estimator for o'^(C): 



1=1 

It is a consistent estimator of (C) , as the following calculations show. We have 

^n(C) = ^ S^{Ini) — ^S^. 
i=l 

Since fc~^5'^ = op(l), it suffices to show that 

1=1 

As for (6.2), we observe that for s > 0, 



1=1 



;=i 



< 4fc„ar„ -> 0. 



(6.10) 



On the other hand, we have proven that '£f=i Snilni) 4 N{0,a'^{C)). By Raikov's the- 
orem, this is equivalent to 



1=1 



Therefore, from (6.10), 



(6.11) 



;=i 
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Let (Wi) be an i.i.d. standard normal sequence, independent of (Xt). Then, (6.11) holds if 
and only if 



1=1 

On the other hand, 

k„ / 



1=1 \l=l J 



vi=l 1=1 



(6.12) 



By virtue of (6.1) , 

Y W,Sn{ Jm) = wAY ^niJm) ) ^ 
1=1 \i=l J 

and from (6.12), we conclude 
which proves that S?2(C)4cr2(c). 
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